A Ranking Approach to Genomic Selection
نویسندگان
چکیده
BACKGROUND Genomic selection (GS) is a recent selective breeding method which uses predictive models based on whole-genome molecular markers. Until now, existing studies formulated GS as the problem of modeling an individual's breeding value for a particular trait of interest, i.e., as a regression problem. To assess predictive accuracy of the model, the Pearson correlation between observed and predicted trait values was used. CONTRIBUTIONS In this paper, we propose to formulate GS as the problem of ranking individuals according to their breeding value. Our proposed framework allows us to employ machine learning methods for ranking which had previously not been considered in the GS literature. To assess ranking accuracy of a model, we introduce a new measure originating from the information retrieval literature called normalized discounted cumulative gain (NDCG). NDCG rewards more strongly models which assign a high rank to individuals with high breeding value. Therefore, NDCG reflects a prerequisite objective in selective breeding: accurate selection of individuals with high breeding value. RESULTS We conducted a comparison of 10 existing regression methods and 3 new ranking methods on 6 datasets, consisting of 4 plant species and 25 traits. Our experimental results suggest that tree-based ensemble methods including McRank, Random Forests and Gradient Boosting Regression Trees achieve excellent ranking accuracy. RKHS regression and RankSVM also achieve good accuracy when used with an RBF kernel. Traditional regression methods such as Bayesian lasso, wBSR and BayesC were found less suitable for ranking. Pearson correlation was found to correlate poorly with NDCG. Our study suggests two important messages. First, ranking methods are a promising research direction in GS. Second, NDCG can be a useful evaluation measure for GS.
منابع مشابه
A new approach for Robot selection in manufacturing using the ellipsoid algorithm
The choice of suitable robots in manufacturing, to improve product quality and to increase productivity, is a complicated decision due to the increase in robot manufacturers and configurations. In this article, a novel approach is proposed to choose among alternatives, differently assessed by decision makers on different criteria, to make the final evaluation for decision-making. The approach i...
متن کاملEvaluation and ranking of suppliers with fuzzy DEA and PROMETHEE approach
Supplier selection is a multi-Criteria problem. This study proposes a hybrid model for supporting the suppliers’ selection and ranking. This research is a two-stage model designed to fully rank the suppliers where each supplier has multiple Inputs and Outputs. First, the supplier evaluation problem is formulated by Data Envelopment Analysis (DEA), since the regarded decision deals with uncertai...
متن کاملSupplier selection using compromise ranking and outranking methods
In today’s highly competitive manufacturing environment, an effective supplier selection process is very important for the success of any business organization. Selection of the best supplier is always a difficult task for the purchasing manager. Suppliers have varied strengths and weaknesses which require careful assessment by the purchasing manager before selecting and ranking them. Any suppl...
متن کاملThe Pattern of Linkage Disequilibrium in Livestock Genome
Linkage disequilibrium (LD) is bases of genomic selection, genomic marker imputation, marker assisted selection (MAS), quantitative trait loci (QTL) mapping, parentage testing and whole genome association studies. The Particular alleles at closed loci have a tendency to be co-inherited. In linked loci this pattern leads to association between alleles in population which is known as LD. Two metr...
متن کاملA Common Weight Data Envelopment Analysis Approach for Material Selection
Material selection is one of the major problems in manufacturing environments since the improper selected material may lead to fail in the production processes and result in customer dissatisfaction and cost inefficiency. Every material has different properties which should be considered as major criteria during the material selection procedure. Selection criteria could be quantitative or quali...
متن کاملIdentification and Ranking Green Supplier Selection Criteria Using One-Sample T-Test and FANP Methods: A Case Study for Petrochemical Industry
Increasing global notices in environmental protection, green supply chain management (GSCM) has received much attention by researchers and managers more than the past. Commonly, firms have considered cost criteria to select their suppliers. Despite the fact that there are various papers considering the formal criteria in supplier selection, there is a few limited numbers considering the environ...
متن کامل